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4 


<110> 


APPLICANT 


: Jon S. 


THORSON 


















6 


<120> 


TITLE 


OF INVENTION: MICROMONOSPORA ECHINOSPORA GENES 




7 






ENCODING FOR BIOSYNTHESIS 


OF 
















8 






CALICHEAMICIN 


AND 


SELF -RES I STANCE 


THERETO 










10 


<130> 


FILE REFERENCE: 26 53-40 


















12 


<140> 


CURRENT APPLICATION NUMBER: 09/724,797 








ST 

11 i 


13 


<141> 


CURRENT FILING DATE: 2000- 


11-28 










= K 


15 


<150> 


PRIOR 


APPLICATION 


NUMBER: 


60/111,325 












16 


<151> 


PRIOR 


FILING DATE: 


1998-12-07 
















18 


<160> 


NUMBER OF 


SEQ 


ID NOS: 


95 


















20 


<170> 


SOFTWARE : 


FastSEQ 


for 


Windows 


Version 


4 . 0 










22 


<210> 


SEQ ID NO 


: 1 






















\ 


23 


<211> 


LENGTH: 546 
























24 


<212> 


TYPE: 


DNA 


























25 


<213> 


ORGANISM: 


Bacteria 




















27 


<220> 


FEATURE : 


























28 


<221> 


NAME/KEY : 


CDS 
























29 


<222> 


LOCATION: 


(1) 


..(546) 




















31 


<400> 


SEQUENCE : 


1 
























32 


atg 


act 


cag 


gag 


aag 


acc 


gca 


ccg 


gec 


gcg 


aag 


age 


acg 


acc 


acc 


aag 


33 


Met 


Thr 


Gin 


Glu 


Lys 


Thr 


Ala 


Pro 


Ala 


Ala 


Lys 


Ser 


Thr 


Thr 


Thr 


Lys 


34 


1 










5 










10 










15 




36 


age 


ace 


gee 


gcg 


aag 


aag 


ccg 


aag 


ccc 


ccg 


aac 


tac 


gac 


ccg 


ttc 


gtc 


37 


Ser 


Thr 


Ala 


Ala 


Lys 


Lys 


Pro 


Lys 


Pro 


Pro 


Asn 


Tyr 


Asp 


Pro 


Phe 


Val 


38 










20 










25 










30 






40 


egg 


cac 


age 


gtc 


act 


gtc 


aag 


gec 


gac 


cgc 


aag 


acc 


gec 


ttc 


aag 


acg 


41 


Arg 


His 


Ser 


Val 


Thr 


Val 


Lys 


Ala 


Asp 


Arg 


Lys 


Thr 


Ala 


Phe 


Lys 


Thr 


42 








35 










40 










45 








44 


ttc 


etc 


gaa 


ggc 


ttt 


ccg 


gag 


tgg 


tgg 


ccg 


aac 


aac 


ttc 


cgc 


acc 


acc 


45 


Phe 


Leu 


Glu 


Gly 


Phe 


Pro 


Glu 


Trp 


Trp 


Pro 


Asn 


Asn 


Phe 


Arg 


Thr 


Thr 


46 




50 










55 










60 










48 


aag 


gtc 


ggg 


gec 


ccg 


ctg 


ggc 


gtc 


gac 


aag 


aag 


ggc 


ggc 


cgc 


tgg 


tac 


49 


Lys 


Val 


Gly 


Ala 


Pro 


Leu 


Gly 


Val 


Asp 


Lys 


Lys 


Gly 


Gly Arg 


Trp 


Tyr 


50 


65 












70 










75 










80 


52 


gag 


ate 


gac 


gag 


cag 


ggc 


gag 


gag 


cac 


acc 


ttc 


ggc 


ctg 


ate 


egg 


aag 


53 


Glu 


He 


Asp 


Glu 


Gin 


Gly 


Glu 


Glu 


His 


Thr 


Phe 


Gly 


Leu 


He 


Arg 


Lys 


54 












85 










90 










95 




56 


gtg 


gac 


gag 


ccg 


gac 


acg 


ctg 


gtc 


ate 


ggc 


tgg 


egg 


etc 


aac 


ggc 


ttc 


57 


Val 


Asp 


Glu 


Pro 


Asp 


Thr 


Leu 


Val 


He 


Gly 


Trp 


Arg 


Leu 


Asn 


Gly 


Phe 


58 










100 










105 










110 






60 


ggc 


egg 


ate 


gac 


ccg 


gac 


aac 


teg 


age 


gag 


ttc 


acc 


gtg 


acc 


ttc 


gtg 


61 


Gly 


Arg 


He 


Asp 


Pro 


Asp 


Asn 


Ser 


Ser 


Glu 


Phe 


Thr 


Val 


Thr 


Phe 


Val 


62 








115 










120 










125 








64 


gec 


gac 


ggc 


cag 


aag 


aag 


acc 


egg 


gtg 


gac 


gtc 


gag 


cac 


acc 


cac 


ttc 


65 


Ala 


Asp 


Gly 


Gin 


Lys 


Lys 


Thr 


Arg 


Val 


Asp 


Val 


Glu 


His 


Thr 


His 


Phe 


66 




130 










135 










140 










68 


gac 


egg 


atg 


ggc 


acc 


aag 


cac 


gee 


aag 


egg 


gtc 


cgc 


aac 


ggc 


atg 


gac 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 
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69 Asp Arg Met Gly 


Thr 


Lys His 


Ala 


Lys 


Arg 


Val 


Arg 


Asn 


Gly 


Met 


Asp 


70 145 




150 








155 










160 


72 aag ggc tgg ccg 


acg 


ate etc 


cag 


teg 


ttc 


cag 


gac 


aag 


ate 


gac 


gag 


73 Lys Gly Trp Pro 


Thr 


He Leu 


Gin 


Ser 


Phe 


Gin 


Asp 


Lys 


He 


Asp 


Glu 


74 


165 








170 










175 




76 gaa ggg gcg aag 


aag 


tga 




















77 Glu Gly Ala Lys 


Lys 


* 




















78 180 
























81 <210> SEQ ID NO: 


2 






















82 <211> LENGTH: 181 






















83 <212> TYPE: PRT 
























84 <213> ORGANISM: 


Bacteria 




















86 <400> SEQUENCE: 


2 






















87 Met Thr Gin Glu 


Lys 


Thr Ala 


Pro 


Ala 


Ala 


Lys 


Ser 


Thr 


Thr 


Thr 


Lys 


88 1 


5 








10 










15 




89 Ser Thr Ala Ala 


Lys 


Lys Pro 


Lys 


Pro 


Pro 


Asn 


Tyr 


Asp 


Pro 


Phe 


Val 


90 20 








25 










30 






91 Arg His Ser Val 


Thr 


Val Lys 


Ala 


Asp 


Arg 


Lys 


Thr 


Ala 


Phe 


Lys 


Thr 


92 35 






40 










45 








93 Phe Leu Glu Gly 


Phe 


Pro Glu 


Trp 


Trp 


Pro 


Asn 


Asn 


Phe 


Arg 


Thr 


Thr 


94 50 • 




55 










60 










95 Lys Val Gly Ala 


Pro 


Leu Gly Val 


Asp 


Lys 


Lys 


Gly 


Gly 


Arg 


Trp 


Tyr 


96 65 




70 








75 










80 


97 Glu He Asp Glu 


Gin 


Gly Glu 


Glu 


His 


Thr 


Phe 


Gly 


Leu 


lie 


Arg 


Lys 


98 


85 








90 










95 




99 Val Asp Glu Pro 


Asp 


Thr Leu 


Val 


He 


Gly 


Trp 


Arg 


Leu 


Asn 


Gly 


Phe 


100 100 








105 










110 






101 Gly Arg He Asp 


Pro 


Asp Asn 


Ser 


Ser 


Glu 


Phe 


Thr 


Val 


Thr 


Phe 


Val 


102 115 






120 










125 








103 Ala Asp Gly Gin 


Lys 


Lys Thr Arg 


Val 


Asp 


Val 


Glu 


His 


Thr 


His 


Phe 


104 130 




135 










140 










105 Asp Arg Met Gly 


Thr 


Lys His 


Ala 


Lys 


Arg 


Val 


Arg 


Asn 


Gly 


Met Asp 


106 145 




150 








155 










160 


107 Lys Gly Trp Pro 


Thr 


He Leu 


Gin 


Ser 


Phe 


Gin 


Asp 


Lys 


He 


Asp 


Glu 


108 


165 








170 










175 




109 Glu Gly Ala Lys 


Lys 






















110 180 
























113 <210> SEQ ID NO 


: 3 






















114 <211> LENGTH: 1155 






















115 <212> TYPE: DNA 
























116 <213> ORGANISM: 


Bacteria 




















118 <220> FEATURE: 
























119 <221> NAME/KEY: 


CDS 






















120 <222> LOCATION: 


(1) 


...(1155) 


















122 <400> SEQUENCE: 


3 






















123 atg gca act age 


gag 


agg ggt 


gtc 


atg 


ate 


ccg 


ctg 


tec 


aag 


gtc 


gec 


124 Met Ala Thr Ser 


Glu 


Arg Gly Val 


Met 


He 


Pro 


Leu 


Ser 


Lys 


Val 


Ala 


125 1 


5 








10 










15 




127 atg tct ccg gac 


gtc 


age acc 


cgc 


gtc 


tec 


gec 


gtc 


ctg 


age 


agt 


ggc 
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128 


Met 


Ser 


Pro 


Asp 


Val 


Ser 


Thr 


Arg 


Val 


Ser 


Ala 


Val 


Leu 


Ser 


Ser 


Gly 




129 








20 










25 










30 








131 


egg 


ctg 


gag 


cac 


ggg 


ccg 


acc 


gtc 


gec 


gag 


tac 


gag 


gcg 


gee 


gtg 


ggc 


144 


132 


Arg 


Leu 


Glu 


His 


Gly 


Pro 


Thr 


Val 


Ala 


Glu 


Tyr 


Glu 


Ala 


Ala 


Val 


Gly 




133 






35 










40 










45 










135 


agt 


cgt 


ate 


ggc 


aac 


ccc 


egg 


gtg 


gtc 


teg 


gtc 


aac 


tgc 


ggc 


acg 


gee 


192 


136 


Ser 


Arg 


He 


Gly 


Asn 


Pro 


Arg 


Val 


Val 


Ser 


Val 


Asn 


Cys 


Gly 


Thr 


Ala 




137 




50 










55 










60 












139 


ggg 


etc 


cac 


ctg 


gcg 


ctg 


age 


etc 


gec 


gcg 


egg 


ccg 


ggg 


gec 


ggc 


gag 


240 


140 


Gly 


Leu 


His 


Leu 


Ala 


Leu 


Ser 


Leu 


Ala 


Ala- 


Arg 


Pro 


Gly 


Ala 


Gly 


Glu 




141 


65 










70 










75 










80 




143 


teg 


gag 


cac 


gac 


ggc 


ccg 


ggc 


gag 


gtg 


etc 


acc 


acg 


ccg 


ctg 


acc 


ttc 


288 


144 


Ser 


Glu 


His 


Asp 


Gly 


Pro 


Gly 


Glu 


Val 


Leu 


Thr 


Thr 


Pro 


Leu 


Thr 


Phe 




145 










85 










90 










95 






147 


gag 


ggc 


acg 


aac 


tgg 


ccg 


ate 


etc 


gec 


aac 


ggg 


ctg 


cgc 


ate 


egg 


tgg 


336 


148 


Glu 


Gly 


Thr 


Asn 


Trp 


Pro 


He 


Leu 


Ala 


Asn 


Gly 


Leu 


Arg 


He 


Arg 


Trp 




149 








100 










105 










110 








151 


gtg 


gac 


gtc 


gac 


ccg 


gec 


acc 


etc 


aac 


atg 


gac 


etc 


gac 


gac 


ctg 


gec 


384 


152 


Val 


Asp 


Val 


Asp 


Pro 


Ala 


Thr 


Leu 


Asn 


Met 


Asp 


Leu 


Asp 


Asp 


Leu 


Ala 




153 






115 










120 










125 










155 


gcg 


aag 


ate 


teg 


ccc 


gec 


acc 


egg 


gee 


ate 


gtg 


gtg 


gtc 


cac 


tgg 


etc 


432 


156 


Ala 


Lys 


He 


Ser 


Pro 


Ala 


Thr 


Arg 


Ala 


He 


Val 


Val 


Val 


His 


Trp 


Leu 




157 




130 










135 










140 












159 


ggc 


tac 


ccg 


gtg 


gac 


etc 


aac 


egg 


ctg 


cgc 


gee 


gtc 


gtg 


gac 


egg 


gec 


480 


160 


Gly 


Tyr 


Pro 


Val 


Asp 


Leu 


Asn 


Arg 


Leu 


Arg 


Ala 


Val 


Val 


Asp 


Arg 


Ala 




161 


145 










150 










155 










160 




163 


acg 


gcg 


gga 


tac 


gac 


cgc 


cgc 


ccg 


ctg 


gtc 


gtg 


gag 


gac 


tgc 


gcg 


cag 


528 


164 


Thr 


Ala 


Gly 


Tyr 


Asp 


Arg 


Arg 


Pro 


Leu 


Val 


Val 


Glu 


Asp 


Cys 


Ala 


Gin 




165 










165 










170 










175 






167 


gcg 


tgg 


ggc 


gee 


ace 


tac 


egg 


ggc 


gcg 


ccg 


ctg 


ggc 


acg 


cac 


ggc 


aac 


576 


168 


Ala 


Trp 


Gly Ala 


Thr 


Tyr 


Arg 


Gly 


Ala 


Pro 


Leu 


Gly 


Thr 


His 


Gly 


Asn 




169 








180 










185 










190 








171 


gtc 


tgc 


gtg 


tac 


age 


acc 


ggc 


gcg 


ate 


aag 


ate 


ctg 


acg 


acc 


ggc 


age 


624 


172 


Val 


Cys 


Val 


Tyr 


Ser 


Thr 


Gly 


Ala 


He 


Lys 


He 


Leu 


Thr 


Thr 


Gly 


Ser 




173 






195 










200 










205 










175 


ggc 


ggc 


ttc 


gtc 


gtg 


ctg 


ccc 


gac 


gac 


gac 


ctg 


tac 


gac 


egg 


etc 


egg 


672 


176 


Gly 


Gly 


Phe 


Val 


Val 


Leu 


Pro 


Asp 


Asp 


Asp 


Leu 


Tyr 


Asp 


Arg 


Leu 


Arg 




177 




210 










215 










220 












179 


ctg 


cgc 


cgc 


tgg 


etc 


ggc 


ate 


gag 


egg 


gcg 


teg 


gac 


egg 


ate 


acc 


ggc 


720 


180 


Leu 


Arg 


Arg 


Trp 


Leu 


Gly 


lie 


Glu 


Arg 


Ala 


Ser 


Asp 


Arg 


He 


Thr 


Gly 




181 


225 










230 










235 










240 




183 


gac 


tac 


gac 


gtc 


gec 


gag 


tgg 


ggc 


tac 


egg 


ttc 


ate 


etc 


aac 


gag 


ate 


768 


184 


Asp 


Tyr 


Asp 


Val 


Ala 


Glu 


Trp 


Gly 


Tyr 


Arg 


Phe 


He 


Leu 


Asn 


Glu 


He 




185 










245 










250 










255 






187 


ggc 


ggg 


gcg 


ate 


ggc 


ctg 


tec 


aac 


ctg 


gaa 


cgc 


gtc 


gac 


gag 


ctg 


ctg 


816 


188 


Gly 


Gly Ala 


He 


Gly 


Leu 


Ser 


Asn 


Leu 


Glu 


Arg 


Val 


Asp 


Glu 


Leu 


Leu 




189 








260 










265 










270 








191 


cgc 


egg 


cac 


egg 


gag 


aac 


gec 


gcg 


ttc 


tac 


gac 


aag 


gaa 


ctg 


gec 


ggc 


864 


192 


Arg 


Arg 


His 


Arg 


Glu 


Asn 


Ala 


Ala 


Phe 


Tyr 


Asp 


Lys 


Glu 


Leu 


Ala 


Gly 
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193 






275 










280 










285 








195 


ate 


gac 


ggc 


gtc 


gag 


cag 


ace 


gag 


egg 


gee 


gac 


gac 


egg 


gag 


ccc 


gcg 


196 


He 


Asp 


Gly 


Val 


Glu 


Gin 


Thr 


Glu 


Arg 


Ala 


Asp 


Asp 


Arg 


Glu 


Pro 


Ala 


197 




290 










295 










300 










199 


ttc 


tgg 


atg 


tac 


ccg 


ctg 


aag 


gtc 


cgc 


gac 


cgt 


ccc 


gec 


ttc 


atg 


cgc 


200 


Phe 


Trp 


Met 


Tyr 


Pro 


Leu 


Lys 


Val 


Arg 


Asp 


Arg 


Pro 


Ala 


Phe 


Met 


Arg 


201 


305 










310 










315 










320 


203 


egg 


ctg 


etc 


gac 


gee 


ggc 


ate 


gee 


acc 


age 


gtc 


gtg 


teg 


cgc 


cgc 


aac 


204 


Arg 


Leu 


Leu 


Asp 


Ala 


Gly 


He 


Ala 


Thr 


Ser 


Val 


Val 


Ser 


Arg 


Arg 


Asn 


205 










325 










330 










335 




207 


gac 


gcg 


cac 


age 


tgc 


gtc 


gcg 


teg 


gee 


cgc 


acc 


acc 


ctg 


ccc 


ggg 


ctg 


208 


Asp 


Ala 


His 


Ser 


Cys 


Val 


Ala 


Ser 


Ala 


Arg 


Thr 


Thr 


Leu 


Pro 


Gly 


Leu 


209 








340 










345 










350 






211 


gac 


egg 


gtg 


gcg 


gac 


cgc 


gtg 


gtc 


cac 


ate 


ccg 


gtg 


ggc 


tgg 


tgg 


etc 


212 


Asp 


Arg 


Val 


Ala 


Asp 


Arg 


Val 


Val 


His 


He 


Pro 


Val 


Gly 


Trp 


Trp 


Leu 


213 






355 










360 










365 








215 


ace 


gag 


gac 


gac 


cgc 


tec 


cac 


gtc 


gtc 


gaa 


acg 


ate 


aag 


tec 


ggc 


tgg 


216 


Thr 


Glu 


Asp 


Asp 


Arg 


Ser 


His 


Val 


Val 


Glu 


Thr 


He 


Lys 


Ser 


Gly 


Trp 


217 




370 










375 










380 










219 


tga 
































220 


* 
































224 


<210> SEQ ID NO 


4 
























225 


<211> LENGTH: 384 
























226 


<212> TYPE: 


PRT 


























227 


<213> ORGANISM: 


Bacteria 




















229 


<400> SEQUENCE: 


4 
























230 


Met 


Ala 


Thr 


Ser 


Glu 


Arg 


Gly 


Val 


Met 


He 


Pro 


Leu 


Ser 


Lys 


Val 


Ala 


231 


1 








5 










10 










15 




232 


Met 


Ser 


Pro 


Asp 


Val 


Ser 


Thr 


Arg 


Val 


Ser 


Ala 


Val 


Leu 


Ser 


Ser 


Gly 


233 








20 










25 










30 






234 


Arg 


Leu 


Glu 


His 


Gly 


Pro 


Thr 


Val 


Ala 


Glu 


Tyr 


Glu 


Ala 


Ala 


Val 


Gly 


235 






35 










40 










45 








236 


Ser 


Arg 


He 


Gly 


Asn 


Pro 


Arg 


Val 


Val 


Ser 


Val 


Asn 


Cys 


Gly 


Thr 


Ala 


237 




50 










55 










60 










238 


Gly 


Leu 


His 


Leu 


Ala 


Leu 


Ser 


Leu 


Ala 


Ala 


Arg 


Pro 


Gly 


Ala 


Gly 


Glu 


239 


65 










70 










75 










80 


240 


Ser 


Glu 


His 


Asp 


Gly 


Pro 


Gly 


Glu 


Val 


Leu 


Thr 


Thr 


Pro 


Leu 


Thr 


Phe 


241 










85 










90 










95 




242 


Glu 


Gly 


Thr 


Asn 


Trp 


Pro 


He 


Leu 


Ala 


Asn 


Gly 


Leu 


Arg 


lie 


Arg 


Trp 


243 








100 










105 










110 






244 


Val 


Asp 


Val 


Asp 


Pro 


Ala 


Thr 


Leu 


Asn 


Met 


Asp 


Leu 


Asp 


Asp 


Leu 


Ala 


245 






115 










120 










125 








246 


Ala 


Lys 


He 


Ser 


Pro 


Ala 


Thr 


Arg 


Ala 


He 


Val 


Val 


Val 


His 


Trp 


Leu 


247 




130 










135 










140 










248 


Gly 


Tyr 


Pro 


Val 


Asp 


Leu 


Asn 


Arg 


Leu 


Arg 


Ala 


Val 


Val 


Asp 


Arg 


Ala 


249 


145 










150 










155 










160 


250 


Thr 


Ala 


Gly 


Tyr 


Asp 


Arg 


Arg 


Pro 


Leu 


Val 


Val 


Glu 


Asp 


Cys 


Ala 


Gin 


251 










165 










170 










175 




252 


Ala 


Trp 


Gly 


Ala 


Thr 


Tyr 


Arg 


Gly 


Ala 


Pro 


Leu 


Gly 


Thr 


His 


Gly 


Asn 



912 



960 



1008 



1056 



1104 



1152 



1155 
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253 








180 










185 










190 






254 


Val 


Cys 


Val 


Tyr 


Ser 


Thr 


Gly 


Ala 


He 


Lys 


He 


Leu 


Thr 


Thr Gly 


Ser 


255 






195 










200 










205 








256 


Gly 


Gly 


Phe 


Val 


Val 


Leu 


Pro 


Asp 


Asp 


Asp 


Leu 


Tyr 


Asp 


Arg 


Leu 


Arg 


257 




210 










215 










220 










258 


Leu 


Arg 


Arg 


Trp 


Leu 


Gly 


He 


Glu 


Arg 


Ala 


Ser 


Asp 


Arg 


He 


Thr 


Gly 


259 


225 










230 










235 










240 


260 


Asp 


Tyr 


Asp 


Val 


Ala 


Glu 


Trp 


Gly 


Tyr 


Arg 


Phe 


He 


Leu 


Asn 


Glu 


He 


261 










245 










250 










255 




262 


Gly Gly 


Ala 


He 


Gly 


Leu 


Ser 


Asn 


Leu 


Glu 


Arg 


Val 


Asp 


Glu 


Leu 


Leu 


263 








260 










265 










270 






264 


Arg 


Arg 


His 


Arg 


Glu 


Asn 


Ala 


Ala 


Phe 


Tyr 


Asp 


Lys 


Glu 


Leu 


Ala 


Gly 


265 






275 










280 










285 








266 


He 


Asp 


Gly Val 


Glu 


Gin 


Thr 


Glu 


Arg 


Ala 


Asp 


Asp 


Arg 


Glu 


Pro 


Ala 


267 




290 










295 










300 










268 


Phe 


Trp 


Met 


Tyr 


Pro 


Leu 


Lys 


Val 


Arg 


Asp 


Arg 


Pro 


Ala 


Phe 


Met 


Arg 


269 


305 










310 










315 










320 


270 


Arg 


Leu 


Leu 


Asp 


Ala 


Gly 


He 


Ala 


Thr 


Ser 


Val 


Val 


Ser Arg 


Arg 


Asn 


271 










325 










330 










335 




272 


Asp 


Ala 


His 


Ser 


Cys 


Val 


Ala 


Ser 


Ala 


Arg 


Thr 


Thr 


Leu 


Pro 


Gly 


Leu 


273 








340 










345 










350 






274 


Asp 


Arg 


Val 


Ala 


Asp 


Arg 


Val 


Val 


His 


He 


Pro 


Val 


Gly 


Trp 


Trp 


Leu 


275 






355 










360 










365 








276 


Thr 


Glu 


Asp 


Asp 


Arg 


Ser 


His 


Val 


Val 


Glu 


Thr 


He 


Lys 


Ser Gly 


Trp 


277 




370 










375 










380 











280 <210> SEQ ID NO: 5 

281 <211> LENGTH: 990 

282 <212> TYPE: DNA 

283 <213> ORGANISM: Bacteria 
2 85 <220> FEATURE: 

286 <221> NAME/KEY: CDS 

287 <222> LOCATION: (1)...(990) 

288 <223> OTHER INFORMATION : biosynthetic gene 
290 <400> SEQUENCE: 5 



291 


gtg 


ccc 


aga 


tec 


ctg 


gtc 


acc 


ggc 


ggc 


ttc 


ggc 


ttc 


gtc 


ggc 


agt 


cac 


48 


292 


Val 


Pro 


Arg 


Ser 


Leu 


Val 


Thr Gly Gly 


Phe 


Gly 


Phe 


Val 


Gly 


Ser 


His 




293 


1 








5 










10 










15 






295 


gtc 


gtc 


gaa 


egg 


ctg 


gtc 


cgc 


egg 


ggt 


gac 


gag 


gtc 


gtc 


gtc 


tac 


gac 


96 


296 


Val 


Val 


Glu 


Arg 


Leu 


Val 


Arg 


Arg Gly Asp 


Glu 


Val 


Val 


Val 


Tyr 


Asp 




297 








20 










25 










30 








299 


etc 


gec 


gac 


ccg 


ccg 


ccc 


gac 


ctg 


gag 


cac 


ccg 


ccg 


ggc 


gcg 


ate 


egg 


144 


300 


Leu 


Ala 


Asp 


Pro 


Pro 


Pro 


Asp 


Leu 


Glu 


His 


Pro 


Pro Gly Ala 


He 


Arg 




301 






35 










40 










45 










303 


cac 


gtc 


cgc 


ggc 


gac 


gtc 


egg 


gac 


gec 


gac 


ggg 


ctg 


gcg 


gee 


gec 


gec 


192 


304 


His 


Val 


Arg 


Gly 


Asp 


Val 


Arg 


Asp 


Ala 


Asp 


Gly 


Leu 


Ala 


Ala 


Ala 


Ala 




305 




50 










55 










60 












307 


acc 


ggc 


gtg 


gac 


gag 


gtc 


tac 


cac 


etc 


gcg 


gcg 


gtc 


gtc 


ggc 


gtc 


gac 


240 


308 


Thr 


Gly 


Val 


Asp 


Glu 


Val 


Tyr 


His 


Leu 


Ala 


Ala 


Val Val Gly Val 


Asp 




309 


65 










70 










75 










80 
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L:649 M:341 W 
L:650 M:341 W 
L:779 M:341 W 



VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/724 , 797 
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TIME: 15:08:35 



Input Set : A:\2653-40 Sequence Listing.txt 
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(46) "n" or "Xaa" used, for SEQ ID# : 9 
(46) "n" 'or "Xaa" used, for SEQ ID# : 9 
(46) "n" or "Xaa" used, for SEQ ID#:10 
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